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PREFACE. 



Thb following elementary treatment of tlie Method of Least 
Squares haa grown out of my attempts to so present the sub- 
ject to students of physics, aatronomy, and engineering, that 
a working knowledge based upon an appreciation of its prin- 
ciples might be acquired with a moderate expenditure of time 
ajid labor. 

CoDceiTing that the ultimate warrant for the legitimacy of 
the method itself is to be found in the agreement between the 
observed distribution of residuals and the distribution repre- 
sented by the error curve, I have not scrupled to abandon 
altogether the analytical demonstrations of the equation of 
this curve and to present it as an empirical formula, represent- 
ing the generalized experience of observers. The evidence in 
support of a formula of this kind is necessarily cumulative, 
and the few curves which are presented in illustration of the 
law of error are to be considered as samples of the kind of 
evidence which exists in great abundance. By abandoning 
the theoretical demonstrations, the student is freed from the 
embarrassments which are usually encountered at the thresh- 
old of the subject, and which in many cases cause it to appear 
as a mathematical puzzle whose analytical difficulties absorb 
the attention of the tyro to the complete exclusion of the pur- 
poses for which the analysis is conducted. 

I have sought to give prominence to the distinction between 
accidental and systematic errors, and to insist upon the limi- , 
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tations whicli result from the difference between these two 
classes of error. To illustrate the principles of the text, I 
have made free use of numerical data and have arranged the 
computations in forms which experience has shown to he 
convenient for the purpose, with a view to their subsequent 
use by the student as models for his own computations. 

In the preparation of these pages, I have consulted many, 
if not most, of the standard treatises upon the subject, biit 
my indebtedness for suggestions and methods of treatment 
is principally to 

Faye, Cours d' Aatronomie de VticoU Polyleckniqu,e. 

Oppolzeb, Lehrbuck der Baknbestimmung. 

Wbight, Treatise on the Adjuatvwnt nf Observations. 

G. C. C. 
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THE METHOD OF LEAST SQUARES. 



§ 1. Pboblem. To determine the coefficient of lioeai expan- 
sion of a certain bar of metal its length waa determined at 
different temperatures by comparison with a standard of known 
length. The data furnished by the measures are (Koklrauack, 
Leitfadm der Phyaik) : 





Otaerred L«> 


20° C. 


1000.22 


40 


1000.65 


60 


1000.90 


60 


1001 .05 



It is required to determine from these observations the amount 
of the expansion of the bar per degree Centigrade. 

If c denote the required expansion, and l^ the length of the 
bar when its temperature is 0° C, its length, I, at any other 
temperature, t, may be represented by the equation 

By means of this equation the four observations recorded 
above are transformed into the following ohservatmi eguationa: 

(1) ;„-|- 20c = 1000.22 ^ 

(2) ;„ + 40 c = 1000.65 I 

(3) io + 50c = 1000.90 [ * * 

(4) io + 60c = 1001.05 J 

Any two of these equations are sufficient to determine the 
values of i, and c, but the values derived from difEerent pairs 
of equations will be different. Thus we may find from 
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EqnatlODi. ■!,' . e 

(1) and (2) 995.70 +0.0ZI6 

(1) aad (1) 999.80 .0208 

(2) and (3) 999.65 .0260 

(3) and (4) 1000.15 .0160 



We are here presented with a problem of constant recur- 
rence in the investigations and applications of physical science. 
In order to determine the values of certain quantities vith a 
high degree of precision more measures or observations are 
made than are absolutely necessary, and these observations 
prove to be inconsistent among themselves, so that the result- 
ing values of the unknown quantities depend upon the manner 
in which the data are combined. It is evident that all of the 
values above found for l^ and c cannot be correct, and it is 
doubtful if any absolutely correct value can be derived from 
the data ; but it is also apparent that the observations are not 
worthless and that any of the values above derived may be 
considered as approximations, more or less close, to the true 
values of the required quantities. 

If we assume that the relation between the length of a bar 
and its temperature can be expressed by an equation of the 
form employed above, we must suppose that the discordances 
in the results are due to errors in the observations, and the 
problem then becomes : 

To find from the observed data a set of results which shall 
be affected as little as possible by the errors of the data, or in 
more technical langua^, to find the most probable values of 
the unknown quajitities. 

We may establish in advance of any fonnal investigation of 
this problem certain principles to which its solution must con- 
form. Thus, 

(A) The adopted values of the quantities which are to be 
determined must be based upon aU the data available. Only 
in exceptional cases, which will be considered hereafter, is it 
proper to omit or reject any observation or any knovn rela- 
tion among the quantities. 
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ERRORS AND RESIDUALS. 3 

(B) The adopted values must satisfy the observation equa- 
tions as nearly as possible. 

g 2. Errors and Beaiduals. The espiession error of an 
obaervatwn has been freely used in the preceding section, but 
it should be recognized that the amount of this error can 
rarely, if ever, be known, since this would imply an exact 
knowledge of the unknown quantities. We may, however, 
obtain approximate values of these errors from the adopted 
values of the quantities which were to be determined. Thus, 
if the values t, = 999.79, c = + 0.0216 be substituted in equa- 
tions (1), these become 

(1) 1000.22 = 1000.22 (3) 1000.87 = 1000.90 

(2) 1000.65 = 1000.65 (4) 1001.08 = 1001.05 

The difference between the first and second members of any 
one of these equations is called the reatdiial of that equation, 
and is approximately the error of the corresponding observa- 
tion. The residuals, v, which correspond to the several values 
of ^ and c derived in § 1 are given below in tabular form. 

(„ = 999.79 999.80 999.65 1000.15 

C = + 0.0215 +0.0208 +0.0260 +0.0150 



- .03 + .06 .00 .00 

+ .03 .00 - .10 .00 

We may thus, for any assumed values of the unknown quan- 
tities, find a corresponding set of residuals, and the smaller 
these residuals are, the closer is the probable approximation of 
the assumed, to the true values. Principle (B). 

This statement, however, requires an important qualification 
to which we now proceed. 

The errors with which any given series of observations is 
affected may be divided into two classes : ' 

Accidental Errors, or those whose law of recurrence is auch 
that in the long run they are as often positive as negative and 
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4 THE METHOD OF LEAST SQUARES. 

whose efEect upon the mean of a great number of observations 
therefore differs but little from zero ; and 

Systematic Errors, or those which in the given series of 
observations do not thus tend to be eliminated from the mean. 
Id the observations considered in § 1, an error of judgment by 
which the observer in a given case read the thermometer 0°,1 
too high would probably be an accidental error, since it may 
be presumed that in the long run he would read it as often too 
low as too high, but if through a fixed habit of observing, the 
thermometer were always read too high this would be a sys- 
tematic error, and the number of observations might be indefi- 
nitely increased without in the least diminishing its effect. 

If the standard of length with which the bar was compared 
were an erroneous standard (e.g. 0.01 mm. too long), aU of the 
observations would be affected with a systematic error due to 
this source, and the residuals would furnish no trace of this 
error, since they show only discordances among the observa- 
tions, and not errors affecting aU alike. The smaUness of the 
residuajs in any case, therefore, furnishes no guaranty that the 
observations and the results derived from them have not been 
vitiated by systematic errors. 

The presence of errors of this class constitutes the greatest 
obstacle to the accurate determination of any set of quantities 
whose values arc sought, and the ingenuity and still of the 
observer or experimenter cannot he better employed than in 
avoiding or overcoming the effect of such errors. It therefore 
deserves especial notice that systematic errors can often be 
transformed into accidental errors by varying the methods of 
observation or the conditions under which the observations are 
made. Thus the possible systematic error of judgment in 
reading a thermometer, to which allusion was made above, may 
be transformed into an accidental error if several different per- 
sons take part in the observations, since it is hardly probable 
that they will all have a common, persistent error of judgment. 
The error due to Using an erroneous standard of length may 
be changed into accidental error by employing a number of 
different standards, since it is not probable that these, con- 
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Btnieted at different times and by different makers, will all 
have a. common error of length. Considerations of this char- 
acter serve to illustrate the great practical importance of vary- 
ing the methods of determining any quantity whose value is 
desired with great precision. Multiplying observations by the 
same method and under similar circiunatances serves only to 
diminish the effect of accidental errors and is useless beyond a 
certain limit, while varying the methods and the circumstances 
under which observations are made tends to eliminate errors 
of both kinds. 

The principles here considered find their appropriate appli- 
cation in the selection of the methods by which any given set 
of unknown quantities is to be determined, but after the obser- 
vations have been made, since they can, in general, furnish but 
little, if any, information in regard to their own systematic 
errors, these must be neglected and the reduction and discus- 
sion of the observations directed toward eliminating the effect 
of the accidental errors. 

§ a The Distribation of Bestdnals. Gauss, a 0«mian mathe- 
matician, has shown by a course of analysis based upon the 
theory of probabilities that in any given series comprising a 
very lai^e number of observations affected with accidental 
errors, the number of errors of a given magnitude, x, is a func- 
tion of that magnitude. Thus, if x' and x" denote any two 
errors, and y' and y" the number of observations having the 
errors x' and x" respectively, then 

y-:y"::f(x'):f{x") 

The analytical expression for/(a;) obtained by Gauss is 

f{x) = -^e->^^^, (2) 

where e = base of the Naperian system of logarithms, 

n- = ratio of the circumference to the diameter of a circle, 
A = a number whose value must be derived for each series 
of observations, but is constant for all the obser- 
vations of that series. 
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The same expression for f{x) has been derived by other 

mathematicians through different courses of analysis, but 
against all of these investigations objections of a theoretical 
character have been ui^ed. Experience, however, shows that 
the actual distribution of residuals does follow this law, not 
with absolute accuracy, but to a remarkable degree of approxi- 
mation. An excellent illustration of this distribution in the 
case of a comparatively small number of observations is 
afforded by a series of ^ determinations of the velocity of 
light made at Washington, in the year 1882.* By means of a 
revolving mirror the time required for the passage of a ray of 
sunlight from one terrestrial point to another was measured. 
The mean of the 66 determinations of this time interval was 
24.827 millionths of a second. By subtracting this mean from 
each single determination a series of residuals will be obtained, 
and the number of residuals whose magnitude equals 1, 2, 3, 
etc. units may then be counted. In this way a fair approxi- 
mation to the distribution of residuals represented by Gauss's 
law of error will be found; but as this law purpoi'ta to repre- 
sent the average distribution of a great number of errors, we 
shall obtain a better comparison between it and the actual dis- 
tribution by the following device, to which we resort in order 
to increase the number of available residuals : 

Let it be assumed that in any given set of observations the 
number of residuals of magnitude x is proportional to the num- 
ber of residuals occurring between the limits x — a and x + a, 
where a is a quantity which in strictness ought to be an infini- 
tesimal, but which may be made a small finite quantity without 
appreciable error. In the present case we adopt as the unit in 
which the residuals are to be expressed, the thousand-millionth 
part of a second (0'. 000000001), and put a equal to two such 
units. Thus, from a series of 66 observations are derived the 
following numbers which represent the distribution of resid- 
uals which might be expected to occur in a much longer series. 

• Velocitj of lAght in Air and Refracting Media. Bureau of Navi- 
gation, Navy Department, 1885, p, 187. 
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BealdnBl. Ko. 



BMldokL 

Greater tban + 13.5 

Equal to -|- 13.6 

+ 12.5 

+ 11.6 
+ 10.5 
+ 9.5 
+ 8.6 
+ 7.6 
+ 6.5 
+ 5.6 



The column headed % represents the number of residuals 
differing not roore than half a unit fiom the magnitude given 
in the first column, espressed as a percentage of the whole 
number of residuals. Fig. A furnishes a graphical representa- 
tion of this distribution, ea«h percentage in the above table 
being represented by a point whose abscissa is the magnitude 
of the residual and whose ordinate is the percent^e itself. 
The curve whose equation is 

3, = 12^e-*"" ft = 0.158 

is shown in the same figure, and a simple inspection of the 
curve shows that its ordinates represent very approximately 
the percentage of residuals of each m^nitude. The coeffi- 
cient h appears multiplied by the faetor 100 in order that the 
ordinates may be represented as percentages. 

Figs. B, C, D, represent the distribution of residuals in three 
other aeries of observations of different kinds, made at differ- 
ent places, by different observers, but all following the same 
law. The unit in which the residuals are expressed, unit of a;, 
is stated with each figure, and the unit of y is in every case 
one per cent of the whole number of residuals. 

The equations of the several curves shown in the figures are 
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Almost identical, but the feature to which the student's atten- 
tion is called is that the algebraic form of the equation is in 
each case 



and not that ft has approximately the same value in each 
curve, . The numerical value of k depends upon the unit 
adopted for x, and these units having been chosen with refer- 
ence to a convenient graphical representation of the residuals, 
the agreement in the several values of k must be regarded as 
purely artificial. 

The series of observations represented in Pig. D is known 
to be affected with small systematic errors, and it will be 
noted that the distribution of the residuals is more irregular 
in this case than in any of the others. In each of the series 
represented in Figs. A and C, there are two residuals whose 
magnitudes are too great to be represented in the figures ; and 
it is quite generally found that the actual number of very large 
residuals is slightly greater than the number given by the 
error curve. The illustrations here given are typical cases, 
and may serve to exemplify the statement made at the begin- 
ning of this section, that the actual distribution of residuals is 
found to follow Gauss's law of error, and in the following sec- 
tions this law will be assumed as experimentally demonstrated, 
and from it will be derived the method of combining and dis- 
cussing observations. The student will find it an instructive 
exereise to treat in a manner similar to that pursued above 
any series of observations to which he may have access, par- 
ticularly his own observations, and thus lend additional weight 
to the experimental evidence which is here presented for his 
consideration. 

§ 4. The Error Cnrve. From the manner in which the ordi- 
nates of the points plotted in Figs. A, B, C, and D were derived, 
it will be apparent that these ordinates represent the number 
of residuals falling within certain chosen limits of error. Thus 
in Fig. A, 8.9 per cent of all the residuals lie between the 
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limits and +1, 8,2 per cent between +1 aiid +2, etc., the 
interval within which the residuals are enumerated being in 
every case one unit. It is also evident that the number of 
residuals falling within any other interval, Az, will depend 
upon the magnitude of this interval as well as upon the ordi- 
nate corresponding to it, and if A a; is taken sufficiently small 
the number of residuals will be proportional to the product 
y.A*. Geometrically considered, this product is the area in- 
cluded between the axis of x, the curve, and the two ordinates 
drawn through the extremities of Ax, and the number of resid- 
uals falling within the limits of Ax is therefore proportional 
to this area. We may, if we choose, make Ax an infinitesimal, 
and the area y ■ A x and the corresponding number of residuals 
will then become indefinitely small, but by taking the sum of 
all the infinitesimal areas included between the limits x=a 
and X = 6, where a and b have any values whatever, we obtain 
the area of that part of the curve included between ordinates 
drawn at these limits. By a similar process of summation we 
obtain the number of residuals lying between a and b, and the 
number of residuals thus found must be proportional to the 
area, since this proportionality is true in every infinitesimal 
element included in the area. 

In the following table, the function. A, represents the area 
of that part of the error curve included between ordinates 
whose abscissas are and x, the argument of the table being 
the values of x for the , particular error curve in whose equa- 
tion k=l; but the area included between and x in the curve 
corresponding to any other value of A may be found from tlie 
same table, by using as the argument hx instead of x. 

The area of that part of the curve lying between the limits 
a and b is represented by 

A = Cydx = — Ce-'^'^dx (.1) 

Let the variable in this expression be changed by putting 
/ix = (, and the expression becomes 



--Lf" 
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These expressions for A become identical if ft = 1 ; hence, if 
the value of ^ be computed from the second integral for A = 1 
and tabulated, we may find from this table the value of A cot- 
reaponding to any other value of h by changing the limits a, 
b into ha and lib. 

A remarkable property of the curve, which wUl be of use 
hereafter, may be readily obtained from the expression here 
found for A. If we make a = and & = oo, the limits of the 
integral become and oo for all values of A, hence the area of 
that part of the curve included between x = and « = no is 
the same for all values of A, i.e., for every series of observations. 



Table ( 



' Abras of the Err ok Cukvk 
Limits asd }tx. 



hw 


^ Ditr. 


hx 


^ 


Dllt. 


fc* 


A UiH. 


0.0 


0,000 


56 


1.0 


0.421 


19 


2.0 


0,49766 


86 


0.1 


.056 


55 


1.1 


,440 


15 


2,1 


.49851 


66 


02 


.111 


63 


1.3 


.465 


12 


2,2 


.49907 


86 


0.8 


.164 


50 


1.3 


.467 




2,3 


.49943 


23 


0.4 


.2U 


40 


1.4 


.476 


7 


2,4 


.499GG 


14 


0.6 


.260 


42 


1,6 


.483 




2.5 


.49980 


9 


06 


,302 


37 


1.6 


.498 


4 


2.6 


.49989 


4 


07 


.339 


32 


1.7 


,492 




2.7 


,49998 


3 


0,8 


.371 


27 


1.8 


.495 


1 


2.8 


.49996 


2 


09 


.898 


23 


1,9 


.496 


2 


2.9 


,49908 




1.0 


.421 




2-0 


-498 




3.0 


.49099 
.50000 





' If in any series of observations n]^ denote the number of 
residuals whose magnitudes are included between the limits a 
and b, n the whole number of residuals in the series, and A^ 
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A^ the values of A obtained from the table with the ailments 
ha, hb, then 

since the ratio' of n] to n is equal to the ratio of the area of 
that part of the error curve which lies between the limits a 
and b, to the area of the whole curve, and this latter area is 
seen from the table to be always unity. The — si^ in this 
equation is to be used when a and b have like signs, and the + 
when they have unlike signs. If the percentage of residuals 
between the limits a and 6 is required, it may be found by 
substituting 100 in place of n as the coefficient of (Ai^A^). 

Thus from Fig. A we find for the series of observations there 
represented, h^ = 0.025 and h ss 0.158. 

To find the distribution of residuals between the limits 
■+I, +1...+4, +4— +0C, 



e proceed as follows : 



-0.7IH) 
-0.316 
+ 0-158 



"C 



The numbers in the column "Per cent" may be compared 
with the percentages given on page 7. The column "Obs." 
gives the actual number of residuals which occur in the given 
series between the limits here considered, and these numbers 
should be compared with the column "n]^-" 

By the use of this table, the distribution of residuals in any 
series of observations for which the value of h is known may 
be compared with the theoretical distribution much more 
readily than by plotting a curve, and the student should in 
this way examine several series of observations. The method 
of determining A for any given series is contained in § 12. 
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§ 5. The Prmoiple of Leut Sqoarei. The quantity k which 
appears in the equatioti of the error curve deserves especial 
attention. If in the equation 



X be put equal to zero, the resulting value of y is — ^' This is 

Vir 
the maximum ordinate of the curve, and the value of this 
maximum ordinate varies directly as k. If those parts of the 
curve remote from the axis of y be considered, it will be found 
that the lai^r is k, the smaller are the values of y, since when 
ic is a large quantity e-**** diminishes much more rapidly for 
increasing values of k than h Itself increases. These relations 
between y and A correspond exactly to the criteria by which 
we estimate the precision of observations. If we compare two 
series of observations, I. and II., and find that in series I. the 
small errors are relatively more numerous (large values of y 
for small it's), and the lai^ errors less numerous (small values 
of y for large a^s), than in series II., we shall without hesitar 
tion call the observations of series I. more precise or accurate 
than those of series II. ; and if required to assign definite 
meanings to the terms "more precise" and "less precise," we 
shall find difficulty in defining them in any other manner than 
by reference to the m^nitude of the residuals. We therefore 
adopt as the meaaure of precision of any series of observations 
the value of h in the equation of its error ciirve ; and havii^ 
thus defined the term " precision," we are able to state two 
principles which are of general application in the discussion 
of observations. 

Let the data furnished by each observation be expressed in 
the form of an observation equation (Equations 1, § 1), then; 
the best attainable values of the unknown quantities are those 
which, 

(1) Distribute the residuals in accordance with the law of 



e-**^, and which, 
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(2) Make the value of 7i in the equation of the resiilting 
error curve a maximum. 

The first of these principles is indeed involved in the sec- 
ond, since if the residuals are not distributed in accordance 
with the law 

Vtt 

there can be no value of A to be made a maximum. It is, how- 
ever, advantageous to state (1) as a separate principle, since it 
afForda a test of the presence of systematic errors in the data, 
which, though far from being a perfect criterion, is often con- 
venient and is sometimes the only test available. 

To justify the statement of (2), we resort to the following 
considerations : In accordance with A, g 1, we suppose that all 
of the data available is contained in the observation equations, 
and, B, § 1, we seek to satisfy aU of these equations as nearly, 
as possible. If the observations are free from systematic 
error, a supposition which must here be made, since we have 
no means of taking into account the effect of such errors, we 
may obtain by substituting in the observation equations any 
set of values which approximately satisfy them, a correspond- 
ing set of residuals which will be the errors of the observations, 
on the supposition that the substituted values were the true 
values of the unknown quantities. If these residuals are plot- 
ted in an error curve, they will furnish a numerical measure 
of the precision h, assigned to the observations by this set of 
values, and out of all possible seta of values of the unknown 
quantities that set which assigns the maximum precision to 
the observations will be entitled to the greatest degree of con- 
fidence ; for if it were otherwise, we should have no reason for 
preferring a set of values which exactly satisfied all of the 
equations to a set which did not satisfy them. 

It is, of course, true that subsequent observations may fur- 
nish a better determination of the unknowns, and that the 
values thus found will not assign to the earlier observations as 
high a degree of precision as did the erroneous values obtained 
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from tliese observations alone, but this subsequent determina- 
tion is based upon additional evidence, and the problem with 
wbicli we are concerned is not to obtain the best possible val- 
ues of the unknown quantities, but the best values which can 
be derived from the data in our possession. 

Assuming, then, the validity of (2), we proceed to transform 
it into an expression more convenient for practical use, and for 
this purpose we resort to the following property of the error 
curve, which may be approximately verified by actual measure- 
ment from any of the plotted curves, Figs. A, B, C, D. 

If the error curve be divided into a great number of parts 
by drawing equidistant ordlnates throughout its whole extent, 
and the areas of the several parts into which the curve is thus 
divided be each multiplied by the square of the abscissa of its 
middle point, the sum of all these products will equal —-. 

The analytical expression for the jjrocess above described is 
C^yOx or -^ Cl?.e->''^'dx 

Put kx = t, and this integral beoomes 

li'^'Jt 2 k' 

For the method of obtaining the value of the last integral, 
see Ifewcomh's Calculus, Articles 169, 176. 

The area of ea«h of the parts into which the curve was 
divided is proportional to the number of residuals occurring 
between the limiting ordinates of the part ; thus, let A denote 
the area of the part, N the corresponding number of residuals, 
ajid n and a the whole number of residuals and the whole area 
of the curve respectively ; then 

A:N::a:n 

but from the table in § 4, a = 1, whence 
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Since ^denotes the number of afs falling within the given 
infinitesimal part, A, of the curre, .Vx' is equal to the sum 
of the squares of the si's (residuals) whose magnitudes fall 
between the limiting ordinates of A, and taking the sum of all 
the Ax"! we obtain ., 

i.e., is equal to the mean of the squares of all the lesiduals. 
It is customary to represent the sum of the squares of the 
residuals by the symbol [w], v standing for any residual, ajid 
the [ ] denoting the sum of all quantities of the kind written 
within them. 
Comparing this result with the one obtained above, we have 

from which it appears that the relation between h and the 
sum of the squares of the residuals is such that when h is 
a maximum, [w] is a minimum, and principle (2) may be 
restated as follows : 

The most probable values of the unkn&wn quantities are those 
which make the sum oftlie squares of the residuals a minimum. 

From this principle has been derived the name Method of 
Least Squares, which is commonly applied to that body of 
principles which treats of the combination and discussion 
of observed data. 

We have axrived at this principle from a consideration of 
that class of eases in which the quantity observed is a func- 
tion of two or more unknown quantities whose values are to 
be. obtained from the observations. This obviously includes 
the case of a single quantity, x, whose value is directly meas- 
ured ; and it will be advantageous to apply the principle of 
least squares to this case. The observation equations are here 
of the simplest possible form. 



where m denotes an observed value of x. 
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If Xa denote any assumed vidue of x, the residuals obtained 
by substituting it in these equations will be 

Vi = m, — Xt 
ViS=m, — Xt, 
Vs = m3 — Xa 



and [rti] = (?», — a^)' + (nii — «„)* + (m^ — x„y 1- (wi, — a^)'. 

The value of x^ which will make [vt-j a miDimum is found 

dXa 

but this eiiuation is equivalent to 

and it thus appears that the universal practice of taking the 
arithmetical mean of all the measures of a single quantity aa 
the best value of that quantity, is a particular case under the 
more general method of least squares. 

§ & We^hti. It frequently happens that the circumstances 
under which an observation was made lead the observer to dis- 
trust its accuracy, while other causes give him increased confi- 
dence in another observation. Observations which thus differ 
in quality are Sfud to have different weights, the weight being 
a numerical measure of the quality, and these weights should 
be taken into account in combining the observations. 

Let us suppose two series of observations made upon the same 
unknown quantity, in one of which the individual observations 
are of different quality and entitled to different degrees of con- 
fidence, while in the other the observations are all equally good, 
but each of them entitled to less confidence than the poorest 
observation of the first series. By taking the mean of a num- 
ber of observations of this second series, a more reliable value 
of the unknown quantity may be obtained than any single 
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observation of the series can furnish, and by properly choosing 
the number of observations to be included in the mean, a value 
entitled to as much confidence as any observatioa of the first 
series may be found. This number of observations of the sec- 
ond series whose mean is entitled to as much confidence as a 
single observation of the first series, is called the weiglit of the 
equivalent observation in the first series; and, obviously, the 
better an observation, the greater is its weight. These weights 
furnish no information about the absolute precision of the 
observations, but express only their relative excellence as com- 
pared with ea«h other ; hence, if pi, pi, p,, etc., be the weights 
of any observations, kpi, kp^, kp,, etc., where k is any constant, 
will express these weights equally well, since it is the ratios 
of the weights, and not their absolute values, which are of 
importance. 

To exhibit the manner in which these weights are to be 
employed, let us recur to the data of § 1, and suppose that 
those observations were made under such conditions that the 
first one has a weight 1, the second 2, the third 3, and the 
fourth 4. In accordance with the definition of weights, this is 
equivalent to supposing a second series of observations of uni- 
form excellence, such that the first of the actual observations 
can be replaced by one observation of this series, which must 
of course be munerically tlie same as the observation which it 
replaces ; the second real observation may be replaced by two 
numerically equal observations of the second series ; the third 
by three, etc. Each of theae substituted observations will fur- 
nish an equation precisely like those given in § 1, and when 
the sum of the squares of the residuals is formed, we shall 
obtain 

The symbol [^w}, which is adopted as an abbreviation for 
this expression, is equivalent, numerically, to the sum obtained 
by multiplying the square of each actual residual by the weight 
of the corresponding observation and adding the products, and 
it is evident that this ^pvv] bears the same relation to the 
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substituted observations that [v?i] bore to the actual observa- 
tions iu the case of equal weights, which was considered in the 
preceding section. The principle there obtained may there- 
fore be generalized as follows : 

The vioat probable values of the unknovm quantitiea are those 
which make the sum of the rceighted squares of tlte residuais, 
[ptiii], a minimum. 

Let the student show, as in the preceding section, that when 
this principle is applied to the case of observations of unequal 
weight made upon a single unknown quantity, it gives as the 
most probable value of tWt quantity 

,. _ C/'™] 

As an example of the application of weights, we select the 
following observations of the time of ending of the transit of 
Mercury of May 6, 1878, winch were observed by difEerent 
observers in the city of Washington. These observers were 
provided with telescopes of different sizes and magnifying 
powers, and differed among themselves in point of experience 
and skill, so that their observed times of last contact are not 
entitled to equal confidence. The weights assigned to the sev- 
eral observations represent the judgment of the computer with 
respect to their relative excellence. (Washington Observar 
tions, 1876, Appendix II., page 55.) 

Observed Time. p ptn 

5' SS- 23' 1 23' [iinO==318 

37 65 

38 10 1 10 [p] = 16 
38 26 3 78 



Weighted mean of the observations = S' aS" 19'.9. 
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g 7. Normal Equatloiu. We have uow to show how the 
principle of least squares is to be applied in determining the 
values of a set of unknown quantities, and in order to fix 
the ideaa as definitely as possible, let it be supposed that there 
are three of these quantities, x, y, z, which are connected with 
each one of a set of observed quantities, n, by the relation 

ax + by + ai = n 
where a, b, and c are numerical coefficients whose values are 
different in the several equations but are supposed known in 
each equation. From a series of more than three observed 
values of n, the most probable values of x, ij, a, are to be 
obtained by means of the relation \jpvv\ = a minimum. 

It is not to be presumed that these values when found will 
exactly satisfy all the equations, and make \j)vv\ = ^, but we 
shall find from each equation a residual v, so that strictly the 
observation equations should be written 

a^-\-h^-\-ca — H, = -i\ ij, 
njfl; + 63^ H- CjS — ji3 = v^ p^ 
etc. etc. etc. 
The symbols pi, p^, ps represent the weights assigned to the 
observed values, Jii, Jij, 713, etc. 

By the ordinary i-ule tor determining a minimum of a func- 
tion of several variables, the condition [j)yi'] = a minimum, 
furnishes the three equations 

Kp^ = (i '£j^ = o ^1^ = 

dx dy dz 

and in order to obtain these derivatives we form from the 
observation equations 

lpoi>'} = \ai-s/p,x + biVjhy + c,Vp,z ~ VpTwi!' 
+ {OiVpsX + biVp^y + c^y/piZ — V^njP 



+ S%V^a; + &3Vpsy + CBy/iaZ-y^W3!' ' 
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The deriyatire of this expression with respect to a; is 



+ 2a~-Vpi\<h\/pjX + ftaVpsy + CsVpaS — VpaWaj 
etc. etc. etc. =0 J 

Let this espression be expanded, divided by 2 and simplified 
by the introduction of [ J to denote the sum of all terms like 
those placed within them (all terms standing in the same verti- - 
c^ column), and it becomes the first of the following group of 

Normal Equations. 

{_p<ta;\x + [pa6]y + [pac^z- [pan] = "j 

[pab^x + lpbb^y + ^pbc^z-lpbn^^O V (7) 

Ipac] X + Ipbc] y + [pec} z — [pat] = J 

The second and third of these equations are derived in pre- 
cisely the same manner as the first from the conditions 

dy dz 

These equations are equal in number to the unknown quanti- 
ties, and their solution will in general furnish a determinate 
set of values for these quantities which will be the most prob- 
able values, since the nonnaJ equations include all of the data 
furnished by the observations and have been so derived as to 
satisfy the principle of least squares. 

Equations (6) furnish a rule which is frequently given for 
the formation of normal equations. To obtain the first normal 
equation, multiply each observation equation by the product of 
its weight into the coefficient of x which occurs in it, and take 
the sum of aU the resulting equations. The other normals are 
similarly obtained from the weights and the coefficients of y, 
z, etc., having due regard to the algebraic signs of the quanti- 
ties in the several multiplications and divisions. This method 
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is occasionally convenient, but in general tlie method of form- 
ing normal equations given iu § 9 will be found less laborious. 

The symmetrical manner in which the coefficients of the 
normal equations are disposed should be especially noted, since 
this considerably diminishes the labor of their formation. The 
hist coefflcient in the second equation is the same as the second 
coefficient in the first equation, and generally the m"' coefficient 
iu the n'" equation is the same as the n* coefficient in the m."" 
equation. 

Let the student form normal equations from the observation 
equations contained in g 1, assuming that those equations have 
equal weights. 

g 8. Non-Linear Obwrratioa Eqvatioiu. In all of the pre- 
ceding investigation, it has been tacitly assumed that the 
relation of the observed to the unknown quantities can be 
eKpressed by an equation of the first degree; but cases in 
which this relation is of a much more complicated character 
are not uncommon, and a method of applying the pi-inciple of 
least squares to these cases is required. For the sake of sim- 
plicity, thia method will be derived for the case of two un- 
known quantities, but the process is perfectly general and can 
readily be extended to any other number of unknowns. 

Let X and y be any two quantities which have not been 
directly measured but which are connected with an observed 
quantity, m, by the relation 

f{x, y) = m 
which represents any equation whatever existing between x, y, 
and m. Let a^ and y, denote approximate values of x and y, 
s„hthat ^ = ^ + i;, ,.J. + A, 

A X and A y being the corrections which must be added to x^ and 
y„ in order to obtain the most probable values of x and y. We 
may, for the present, suppose that ai, and y^ are mere guesses 
at the values of x and y, and we may test their correctness by 
substituting their numerical values in the equation 
/(x, y') = m 
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whicli corresponds to each observed value of m. If every such 
equation were exactly satisfied by these values, we should infer 
that x^ and yo were the most probable values of x and y. It 
cannot be expected that this perfect agreement will ever be 
found in practice, but from each observation equation a resid- 
ual, V, will be found, due partly to the errors of the observa- 
tions and partly to Ax and Ay. 

If in the equation f{x,y) = m. we substitute for x and y 
Xf + Ax, ya + Ay, and develop the expression by Taylor's 
Formula, remembering that m —/{x^ y^) is the residual n 
found by substituting numerical values of x^, y,, in the several 
observation equations, we have 



. dm dm 
equation, it becomes 



be introduced into this 



a-Ax + b-Ay-n = (i 
Each obsei-vation equation may thus be made to furnish a lin- 
ear equation involving A x and A y, and these equations may 
be treated by the method of § 7. It must, however, be remem- 
bered that in the above development by Taylor's Formula we 
have retained only the first three terms of an infinite series, 
and if the approximate values x„ y^ are not so nearly the most 
probable values that the squares and higher powers of A* and 
Ay are inappreciable, the development and the solution based 
upon it are inaccurate. On this account, it is seldom advanta^ 
geous to make a least square solution for the unknown quanti- 
ties until very approximate values of them have been found. 
These values will usually be obtained from the solution of a 
small number of the observation equations. 

The transformation of the observation equations by the 
introduction of corrections to assumed values of the unknowns 
is often advantageous even when the original equations are of 
the first degree, especially if the original quantities were of 
very different magnitudes. Thus, in the problem of § 1, the 
observation equations are of the form 



zedbyCoOgIC 



-1000 



FORMATION AND SOLUTION OF NOEMAL EQUATIONS. )i6 

Mi=/(Zo,c) = itt + tc 
in whieli c is a very small quantity while 1^ ia approximately 
1000. If we put 

l^ = 1000 + Ala c = + Ac 

t dm i dm , 
we have - — = 1 — = ( 
d/u dc 

and ttie equations are transformed into 

A/o + 20Ac- 0.22 = 

Ai, + 40Ac — 0.65 = 

Aio + 60Ac-0.90 = 

A?o + 60Ac-1.05 = 
By this transformation tlie numerical operations involved In 
forming and solving the normal equations are much simplified 
through the substitution of small numbers in the place of 
large ones. 

g 9. Ponnatioii and Solntion of the Normal Eqaatiom. If 
the number of unknown quantities is greater than two, and 
especially If the number of observations is large, the numerical 
computation of a set of normal equations is a laborious process, 
and one in which errors are almost certain to occur unless 
special precautions are taken to guard against them. The 
method of forming these equations presented in this section 
has been developed with special reference to facilitating the 
numerical operations and obtaining the normals with the least 
expenditure of labor consistent with the requisite accuracy, 
and although some of the processes may seem at first sight 
unnecessary and cumbrous, a little experience in their use, or 
in their neglect, will convince the student that they ai'e in 
the long run labor-saving devices. 

Let each observation equation be written out and arranged 
in tabular form, as in the following example. In order that 
these equations should furnish a good determination of the 
unknowns, x, y, z, it is necessary that the coefficients of these 
quantities should present a considerable range in their values 
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in the several equations. Thus, if all the coefficients of x 
were alike, all the eoeffieients of y equal ea«h to each, etc., the 
equations would be absolutely indeterminate since we should 
have several unknown quantities involved in a single equation 
many times repeated, and if the coefficients approximate to 
this equality the equations will be approximately indeter- 
minate, and will furnish unreliable values of the unknowns. 
If, therefore, several observations have been made under similar 
conditions, and furnish equations which are nearly identical, 
these will be nearly equivalent to a repetition of the same 
equation and it will be permissible to take their mean, having 
regard to their respective weights, and treat it as a single 
observation equation with a weight equal to the sum of the 
we^hts of the observations. 

Having thus reduced the number of equations as far as 
possible, each equation should be multiplied by the square root 
of its weight as was done, § 7, in obtaining the foi-m of the 
normal equations. By this multiplication the weights will be 
completely taken into account and will require no further 
attention. Let the ioeigkted equations thus obtained be repre- 
sented by 

CTjiC + b^ + c^ + H^ = 

Os* + 63^ + Ca* + "s — 



It will usually facilitate the formation of the normals to so 
transform these equations that no number greater than 1 shall 
occur in any of them. This can always be done by introducing 
new unknown quantities and dividing each equation by some 
constant number, usually some power of 10. Thus in the ease 
of the two equations 

5a! + 71i/-G3 = 

0.9a; -^193y-h 93 = 
let each equation be divided by 100 and put 
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the equations are thus transformed into 

1.000 It + 0.368 w - 0.630 = 

0.180 u - 1.000 w + 0.930 = 
The solution of these equations will furnish values of w and w 
from which x and y may be found by the relations 

The purpose of this transformation is to simplify the subse- 
quent numerical work by reducing the numbers involved to 
an approximate equality. 

Every coefficient which appears in the normal equations is 
the sum of a series of products of two quantities, thus 

[«U] =: «!«, + djOj + fl3«, H 

[aft] = a,b, + aA + a^, + - 

[bn] = 6iJi, + 6jnj + bjig H 

These products may be formed by the aid of Crelle'a multipli- 
cation tables • supplemented by a table of squares of num- 
bers for the [aa], [bb'\, etc. In case Grelt^a tables are not 
available, the products may be formed by logarithms or much 
more rapidly by the following method due to Bessel Form for 
each equation the sums a-\-b, a + c,b+c, etc., for every pair 
of numbers contained In the equation ; then since 

ab = i\ia+b)'-aa-bb\ 
we have [ab] = i i [(a + 6)'] - [«o] - [i*] H 

[fe]=i![(&.fc)']-[66]-[cc]i \ (8) 

etc. etc. etc. J 

The [aa], [66], [cc] are coefficients in the normal equations 
and must be computed iu any case, and the formation of [«6], 
[6c], etc., therefore requires for each coefficient only the single 
additional quantity [(<i + 6)"], [(6+c)^], and presents the 
very ^eat advantage that these quantities can be obtained 

♦Crelle, Rechentafeln, Berlin. These tables give the products of 
«11 numbers up to 1000 X 1000, and are of very general utility. 
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from a table of squares, and being all positive numbers no 
attention need be paid to the signs after tbe sums a + b, b + c, 
etc., have been formed. 

mo method of computation can furnish a guaranty against 
the commission of numerical errors, and it is therefore desir- 
able to test the computation from time to time to ascertain if 
such errors have occurred. To secure such a test or " check," 
as it is called, we introduce the following auxiliary quantities, 
one for each observation equation : 

«. = a, + 6, + <-, + - + «.] 

«, = ai + 6, + f,+ - + "j > (9) 

etc. etc. etc. J 
and form the quantities [as] [bs'] ■■■ [sn]. It will appear from 
the mode in which tbe coefiBcients of the noi-mal equations 
arc formed that 

CheC [»"J + f""] + ["^ + - + f™] - M 1 ■ 

[06] + Ibb} + [6c] + ... + [6«] = [6.,] \ (9i.) 

etc. etc. etc. J 

The [^aa], [bs], etc., are formed in precisely the same manner 
as [a6], [ae], etc., and the check relations above given must 
be satisfied by the computed values of these quantities. 

Where only two unknown quantities are involved in the 
normal equations the solution of the equations may be con- 
veniently made by any of the methods of elementary algebra, 
but if the number of unknowns is greater than two, the simple 
and elegant method of successive substitutions proposed by - 
Causa may be employed with advantage. 

The normal equations in the case of three unknown quanti- 
ties are : 

[o«]a:+[«6],+ [ac]2 + [o»] = 0l 
[a6]. + [66]9+[6o]2 + [6«]=0 ^ (10) 

[ac]a;+[6e]y+[cc]j + [c»] = oJ 
and from the first of these 

laaf [oo] [o«] 
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This value of x substituted in the second and third equations 
transforms them into 

[6i..l]y+[4c.l>+[ta.l] = j 
[S«.l]s + [ec.l> + [e».l]-0 1 ' ' 

in which 

[».1] = [»]-M[ad] [co-l] = [«.]-Mr„cJ~] 
[aa] [aa^ ' 

Cte.l] = [te]-M[„] [i,o.l] = [fe]_M[„i,] 
[te-l] = [ta]-l5y[.n] [cn.l]s[c«]-£^j[a«] | 

These equations constitute a new set of normals, from which 
one unknown quantity has been eliminated. The correctness 
of the numerical work of this elimination may be tested by a 
continuation of the checks used in forming the original nor- 
mals. We introduce an auxiliary quantity 

[6s-1]=[66-1] + [&c-1] + [6h-1] 

and inquire its relation to [as], [bs], etc. If we substitute in 
the expression for [6s -1] the values of [66-1], [fic-l], [6)i-l] 
in terms of the original coefficients, having regard to the 
relations 

[aa] + [a6] + M + [«i.]-[«.] I 

whence [6.-l] = M--p|M 



(13) 



and similarly, 



M^ 



l5£lr, 



[c.i] = M-t;-yM 

We may therefore obtain a complete check upon the aecurafy 
of the numerical work involved in the elimination of x, by 
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fonuing the quantities [&s • 1], [c8 • 1], in tbe same manner as 
[66-1], [tC'l], [fcn-l], etc., and comparing tlie actual sums 
of tliese latter quantities with the computed check quantities. 
By a repetition of the process of elimination we obtain 

[cc.2]z+[cn.2] = Check [c«.2] 



where [cc.2] = [cc.l]- 



[c„.2]_[c..l]. 
[cs.2] = [cs.l]- 



[i«-l] 
[44.1] 

[fe-l] 
'[441] 

"[44.1] 
= [<..2]+[c».2] 



[4c-l]" 


[4..1] 


[4..1] 



(15) 



ind we are enabled to write the following equivalents for the 
>riginal aonoal equations. 



Elimination Equations. 



'- ^ 


^ 


- u^ 


= 


'-[:f 


^ 


' + [cc.2J- 


.0 
= 



(10) 



The last of these equations gives the value of z directly, the 
second furnishes y as soon as £ is known, and the first gives 
the value of x. The whole solution is therefore reduced to 
finding the values of the coefficients and absolute terms in 
these elimiTiation equations. A convenient arrangement of the 
computation by which these quantities are obtained is given 
in the following example, in which the actual computation is 
exhibited upon one page, and the opposite page contains a 
schedule correspondingly arranged showing the analytical 
equivalent of eacti number contained in the computation. 
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In making the multiplications of [oi/], [^ac'], [an], [as], by 
the constant factor ^ — J, the logarithm of this factor is written 

on the edge of a slip of paper, and being held successively 
adjacent to the logarithms of [ai], [iic], [an], [as], the sum of 
the two logarithms is taken mentally, the corresponding num- 
ber looked out from a logarithmic table and written in its 
proper plafie under [66], [6c], [6tt], [6s], a subtraction then 
gives the value of [W> ■ 1], [be • 1], [bn • 1], [6a ■ 1], and a simi- 
lar process is followed for each other derived coefficient. 

§ 10. Example. To illustrate the principles contained in 
the preceding sections, and to exhibit in detail the process of 
deriving the most probable values of several unknown quanti- 
ties which are connected with the observed quantity by a 
rather complicated relation, we select from Vol, iii. Part 1 of 
the Memoirs of the National Academy of Sciences, page 68, the 
following series of experiments made with a lO^uge Colt 
gun, loaded with uniform charges of four drams of powder and 
1^ ounces of shot, the shot rai^ng in fineness from No. 10 
up to No. 1 Buck. The purpose of the experiments was to 
determine the relation existing between the size (fineness) 
of the shot and its average velocity over a range of 30 yards. 
The following table contains the results of the experiments, 
each velocity being the mean result of from three to six dis- 
charges of the gun. The weight of a pellet of No. 10 shot is 
taken as the unit of weight, and the velocities are expressed 
in feet per second. 

Slie. Weight. Omerred Teloolty. 



BB. 16 1000 

FF. 33 1017 

No. 1 Back 64 1067 

By plotting these results in a curve with the weight of the 
shot as abscissas, and the observed velocities as ordinates, the 
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experimenter reached the conclusioD that the relation between 
the weight Wa,nd the velocity V, is expressed by an equation 
of the form 

in which l,m, n, are constants whose values are to be deter- 
mined from the observations. It will be found upon trial that 
It, = 700, wio = 0-28, n„ = 0.42, in connection with the observed 
values of V and W will approximately satisfy this equation, 
and we therefore adopt these approximate values and proceed 
(§ 8) to determine the corrections A I, Am, An, which when 
added to I^ m^, Rt« will furnish the most probable values of 
t, m, n. 

The several differential coefdcients of the observation equa- 
tion, V=f(l,m,n, IF) 9X6 

dV^V 

'IV h ^v 

dV /„, „, ^V 
in which M denotes the modulus of the common system of 



logarithms, M= 0.43429. In the factor, cot—, — is the ratio of 

two numbers, and must be construed as representing a certain 
arc expressed in parts of the radius: the corresponding arc 
expressed in degrees is 57''.2958— . 

The form of the observation equation with which we arc 
here concerned is 



and introducing into this equation the numerical values of 
In »iiw iiw V, W, M, we find the following 



zedbyCoOgIC 



Okhekvation Equations. 



(1) 


1.29 a( 


- 729A) 


n+ OA 


n + 53 = 


(2) 


1.36 


-636 


+ 104 


+ 32 = 


<3) 


1.41 


-396 


+ 164 


+ 24 = 


W 


1.46 


-294 


+ 172 


+ 28-0 


(5) 


1.48 


-219 


+ 1T0 


+ 38 = 


(6) 


1.S0 


-164 


+ 169 


+ 37 = 


m 


1.62 


-122 


+ 142 


- 2-0 



The absolute tenns of these ec[uation8 are residuals obtained 
by substituting in the original equation 

the assumed values of l^, nta, and Kg, and the smallaess of these 
residuals compared with the values of V, shows that the 
assumed quantities are approximately correct values of I, m,n. 
This computed value of Fwas used instead of the observed V 
in deriving the coefiBcieuts of the equations. The memoir from 
which our data are taken contains no indication of the weights 
to be assigned to the several determinations of V, and in the 
absence of such information they should all be treated as 
equally precise and given the weight 1; but for the sake of 
illustration a slightly different set of weights indicated above 
by p has been assigned to them, and by multiplying each equa- 
tion by the square root of its weight we obtain the following 

Weighted Obseevatios Equations. 



1.29Ai-729A) 


M+ OA 


n+53-0 


1.36 - 635 


+ 104 


+ 32-0 


1.26 - 362 


+ 137 


+ 21 = 


1.45 - 294 


+ 172 


+ 28-0 


1.62 - 239 


+ 185 


+ 42-0 


1.58 - 172 


+ 167 


+ 38 = 


1.52 - 122 


+ 142 


- 2 = 
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The coefficients and absolute terms in these equationa are of 
very difEerent m^nitudes, and to simplify the subsequent 
numerical work we divide each equation through by 100 and 

0.0162 7.29 1.86 

and introduce x, y, x into the equations in place ot Al, Am, A n. 
This step, which is frequently called rendering the equations 
homogeneous, furnishes the following 

HoMooisNEOus Weighted Observation Equations. 

0.796a; - l.OOOy + 0.000^ + 0.530 = s = + 0.326 

0.839 - 0.733 + 0.563 + 0,320 = 0.989 

0.777 - 0.482 + 0.741 + 0.210 = 1.246 

0.895 - 0.403 + 0.931 + 0.280 = 1.703 

1.000 - 0.327 + 1.000 + 0.420 = 2.093 

0.97i3 -0.236 +0.903 +0.380 = 2.022 

0.938 - 0.167 + 0.768 - 0.020 = 1.519 
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Tlie values of s = a + b+c + n, wliicli ate to be used aa a 
check in the formation of the normal equations, are derived 
from these equations. 

The formation of the coeificients of the normal equations by 
the use of a table of squares, SesaeVa method, is represented in 
the following tables : 



Sums ( 



FIE Coefficients. 



BqimtfoB 


a + 6 


«.c 


„.» 


a.. 


.c 


..„ 


„., 


c.» 


... 


1.. 




0.204 


0.796 


1.326 


1.122 1 


000 


0.470 


0.674 


0.630 


0.326 


2.. 




.100 


1.402 


1.159 


1.828 


170 


.413 


0.256 


0.883 


1 


562 


3. 




.295 


1.618 


0,987 


2.023 


259 


.272 


0.764 


0.951 


1 


987 


4. 




.492 


1.826 


1.175 


2.698 


528 


.123 


1,300 


1.211 


2 


63* 


5.. 




.673 


2.000 


1.420 


3.093 


673 


.093 


1,766 


1.420 


3 


093 


6. 




.739 


1.878 


1.365 


2.997 


667 


.144 


1.786 


1.383 


2 


926 


7.. 




.771 


1.706 


0.918 


2.457 


601 


.187 


1.362 


0.748 


2 


287 
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EXAMPLE. So 

From the sums of the squares contained in the several col- 
umns of this table the coefficients [a6], [ac], etc., are computed 
at the foot of the columns by the relations 

[„S] = t l[(a + (,)>] - ([a'J + p<]) !, etc. 

The check quantity [ae] is compared with 

[ao] + [<>6] + [ac] + [an] 

whose value is written immediately under [as], and which 
must agree with [as] within two or three units of the last 
decimal place. Every coefficient of the normal equations 
enters into one or more of these sums, which therefore furnish 
a complete test of the accuracy of the work in passing from the 
homogeneous observation equations to the normal equations. 
We now write the 

KoRMAL Equations. 

+ 5.576a! - 2Mly + 4.480z + 1.875 = 
- 2.861a: + 2.l22y - 1.813a - 1.200 = 
+ 4.480a; - l.SlSy + 4.138z + 1.348 = 

It may be seen from an inspection of these equations that 
the data upon which they are based will not furnish a good 
determination of the values of all the unknowns, for if the 
first equation be divided by — 2 the quotient will be very like 
the second equation, and if it be multiplied by + ^ the product 
will be very like the third equation. We proceed, however, 
with the solution hy Gauss' method, which will furnish the 
best results that the data can be made to vield. 
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Solution or the Kobmal Equations. 

log [a..] logK] log[aO log[,.BJ log[a«] 



-S] 


TO 














■I] 

•1] 


logp..l] 




log[l..l] 


Ckedc sum. 
I0BP..I] 


-e^ 










[,»] 

[i]^""= 




-[^ 


[—I] 


g 


[,..1] 


["■ 1] 
Cftec^ turn. 



["•2] 


[C..2] 


[™.2] 
CflecA gum. 


S [,,.!] 


loB[,«.2] 





Elimination Equations. 

, +[±1111, +l>Li] = o 

['"■2] 

The course of tbe computation after the fotmatlon of the elUnina- 
ioD equations is suHtcicntly indicated upon tlie opposite pBj^c. 



Di„m,db,G(50glc 



Solution or the N^okmal Equations. 



+ 5,67G 

0.7464 


- 2.881 
0.4666 n 


+ 4.480 
0.6618 


+ 1.875 
0.2780 


+ 9-070 
0.9676 




+ 2.122 


- 1.813 


-1.200 


-3.752 




+ 1.468 


-2.299 


-0962 


-4.664 




+ 0.654 


+ 0.466 


-0,288 


+ 0.902 




9.6156 


9.6666 


9.3T66ti 


9.9552 






+ 4.138 


+ 1.348 


+ 8.165 






+ 3.699 


+ 1.607 


+ 7.286 



+ 0.539 -0-169 +0.867 

+ 0.361 -0.177 +0.6T0 

+ 0.178 +0.018 +0,197 
9.2604 8,2668 



Elimination Equations. 

% — 0,613 J + 0.803i + 0.336 = i = - 0,030 

y + 0,743 1 - 0.3G4 = y = + 0.439 

1 + 0,101 = r = -0,101 



lORI 


8.4771 n 


logy 


.0162 


8.2096 


log 7,29 


Al 


-1.8 


Am 


l« 


700,0 


mo 



9,0049 n 
0.2672 

- 0,0547 
+ 0,4200 
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If with the values of t, m, n thus obtained the corresponding 
velocities be computed by means of the original equation 

V .W 

— = sec ' 

the resulting residuals should be smaller than those derived 
from the substitution of {^i *% '■oi i-s., the absolute terms of 
the observation equations. The following comparison of these 
residuals shows a much better representation of the observed 
values of V, especially if the suras of the squares, \yv], be 
compared. 

OlM«Tved — Compated F. 

Weight Of Shot 1 2 4 S 16 82 64 

AU'"h'k) -53, -32, -24, -28, -38. -37, +2 

fll, m, ii) - 6, + 10, + 13, + 4. - 10, - la, + 22 

Not only are the residuals diminished in magnitude, but 
their distributiou is much more nearly in agreement with the 
law of error. 

The values thus obtained for I, m, n ought not to be con- 
sidered the best attainable, since the corrections Am, An are 
relatively large fractions of m, and «„, and it is probable that 
the neglected terms containing Am, An , etc., have an appreci- 
able influence upon the solution. To secure the utmost accuracy 
these values of I, m, n should be treated as new approximations 
and another set of corrections Al, Am, An derived. This re- 
solution is recommended to the student as a valuable exercise. 
Let the student also derive from the data of § 1 the most 
probable values of f„ and c, assigning unequal weights to the 
several equations. 

§ 11. Conditioned Obserratioiu. There is a class of cases in 
which the application of the principle of least squares seems 
to produce absurd results. Thus if each angle of a plane tri- 
angle be measured many times in order to obtain an accurate 
set of values for the angles, the application of the principle 
that the [pro] must be made a minimum will furnish as the 
most probable value of each angle the weighted mean of the 
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s of that angle, but the sum of these weighted means 
will usually differ slightly from 180°, and since the sum of the 
angles of every plane triangle must equal 180° it appears that 
the moat probable values above derived are impossible values. 
It must, however, be noted that the method of treatment above 
outlined is itself a violation of Principle A, § 1, since the knowl- 
edge that the sum of the angles must equal 180° furnishes a 
relation among those angles which may be used and ought to 
be used in determining their most probable values ; and the ap- 
parent absurdity above found is produced by neglecting this 
part of the data. 

A relation such as the above which must be exactly satisfied 
by a set of observed quantities is called a rigorous condition, 
the equation by which the relation is expressed is called an 
equation of condition, and observations of such quantities are 
known as conditioned observations. The number of rigorous 
conditions is, of course, always less than the number of un- 
known quantities, since if it were equal to the number of such 
quantities the values of the latter would be determined by the 
conditions alone, independently of any observations. 

In order to develop a convenient method of treating rigorous 
conditions, let x,y,zb% three unknown quantities which are to 
be determined from observation, but whose values are required 
to satisfy the equations of condition 

<l.{x,y,z)=0 ^{x,y,z) = Q 

Let the measurements or observations for the determination of 
the unknown quantities be represented by observation equa- 
tions of the form 

f,(i,m)-0 /,(!,,»)-0 /.(2,5) = 

m, n, and q being the quantities directly measured, and the 
measures for the determination of x being quite independent 
of those for y, z, etc. In accordance with the principles of 
least squares the values of the unknown quantities are to be so 
determined that [piw] shall be made a minimum in each series 
of observations above represented, and therefore the sum of all 
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the weighted squares of the residuals must also be a minimum. 
Owing to the conditions ift^x, y, z) = 0, ^(a;, y, s) = it will not 
in general be possible to assign to the unknown quantitiea 
values which will give to [puv] its least possible value, and the 
problem becomes one of conditioned or relative minima, i.e. 
out of all the seta of values of x, y, z which will exactly satisfy 
the equations of condition it is required to lind that set which 
assigns to [/ivv] its least value consistent with those equations. 
The method of determining relative minima is as foUowa : 
(Jordan, Coara d'Anaiym, Vol. i., § 205). Multiply each equa- 
tion of condition by an undetermined constant factor, and add 
the products to the function which is to be made a minimum. 
The derivative of the new function with respect to each un- 
known quantity must be placed equal to 0, and the equations 
thus formed, together with the equations of condition, will be 
just sufficient to determine the unknown quantities and the con- 
stant multipliers. Thus, in the present case, representing the 
multipliers by — 2 A:, and — 2 ftj, we have for the new function 

«.= [>w]-2&,*(a!, y, 2)-2&,f(x, y, z) (17) 

and ^=0 ^=0 *^ = (18) 

da; dy dz 

*(x, y, a) = ^(x,y,z) = (} 

will determine A;,, k^, x, y, and z. 

It was shown in § 7 that in general for three unknown 
quantities, 

^tg5] = 2[roa]« + 2[p«6]j, + 2[pac]2 + 2[po.] 

but in the case here considered those observation equations 
which contain x do not contain either y or z, and, therefore, the 
6 and c coefficients in those equations are to be considered 
zero, all the products ab, ac, are also zero, and 

??[|=a_2[yaa> + 2[;,»] 

with similar expressions for the y and x derivatives. 
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Denoting for the sake of brevity fix, y, z) and ^(a;, y, z) 
by <t> and t\i respectively, we obtain by differentiating w 



^%- 


= [p66].,+[p6»]-i,^-fcg = 


*s= 


.i,ccy+\_pcn-i-k,"i-k,^- 


from whieh 




»=- 


[p<.»] a K dt k. 


V- 




... 


" [pec] "^ &■[,«] + ,12 '[pec] 



These equations determine the values of a^ y, z, when J, and 
.fcj are known, and it should be observed that the first terms of 
the second members of the equations are the values of x, y, z, 
which would be obtained by treating the observations as if 
these quantities were entirely independent of each other, e.g. 
in the case of direct observations of the quantities they are 
the weighted means of the observations. If we represent the 
values thus obtained by a^, y^, z«, and represent by v^, Vj, u, the 
corrections which must be added to these quantities in order 
to obtain the most probable values of x, y, z, i.e. put 

X = Xa + V, y = yo + i!i Z = Z„+Vs 

we shall have 

' dx [ptta] dx [poa] , 
dtit k, 



dtft *i , d^ kj 
dz [_p<x] dz {^pcc] 
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The quantities /ci and kj are called correlates, and front the 
manner in which they were introduced it appears that the 
number of correlates is equal to the number of rigorous con- 
ditions to which the observed quantities are subject. To 
determine the values of the correlates let X|,-|-W],yi,+)ii, «„ + *'» 
be substituted for x, y, st in the equations of condition, and 
the equations be developed by Taylor's Formula, giving 

*(.. ., .) = *(.., ,,^)+^^..+|„. + g.. + e.„. = 0. 

and a similar expression for ^(x, y, £). Let the values of 
fi, «,, r, in terms of A, and ftj be substituted in these equations, 
and put 



(20) 



and the equations become 

lo«. yc, + 1: <.j3]i, + ^ (x„ y„ *«) = > 

[a^]A. + [j3^]i. + ^ (X„ 2,,, ^,) = f '•^^•' 

from which the values of kj and k, may be obtained, and thus 

the values of «„ »„ »i from equations (19), which may now be 

put in the following more convenient form : 

■v/ [ jiaa] V, ^ aiki + jSi^i 

V[/^]v, = a,t,-l-j8A 

■■J[_pec ]vs = ask, + ftA, 

The method by which the above equations have been derived 
for the case of three unknown quantities connected by two 
equations of condition is perfectly general and may be ex- 
tended to any other number of quantities whose values are 
to be obtained from independent observations. In the cases 
which actually arise in practice the observation equations and 
equations of condition are usually of simple form, the differen- 
tial coefficients and the quantities a, b, c, "etc., being usually 
equal to either 1 or 0. 



Digitized byCoOglc 



CONDITIONED OBSERVATIONS. 



43 



PKObLEM. Let the student show by the method of corre- 
lates that if the sum of the measured angles of a plane triangle 
exceed 180° by a quantity e, the angles must be corrected by 
distributing e among them in such a manner that the correction 
to eafih angle is inversely proportional to the weight of the 



To illustrate the application of the principles of the present 
section to a numerical problem, we select from the U. S. C. & 
G. Surven Report for 1884, pages 409 et seq., the following tele- 
graphic determinations of longitude, and seek to adjust them 
so that they shall be mutually consistent Each difference of 
longitude between two stations was directly observed, so that 
the obsei-vation equations are all of the form x = m„ a; = mij, 
etc., and the values given below are the weighted means of the 
individual observations of each series. The probable error of 
each determination (see § 12) is placed immediately after the 
quantity itself, and the weights of the determinations are 
assumed to be inversely propori;ional to the squares of the 
probable errors. 



Ctunbridge, Mttss., 
Washington, D.C, 
Cambridge. Mass., 
Cleveland, O., 
Cambridge, Mass., 
Columbus. O., 
Washington, B.C., 
Columbus, 0., 
Cleveland, O . 
Colambns, O., 



Observed 
LniiKltaile. 



23°' 4i:o+I ± 0?01B 
42 14.875^.0.038 
47 27.713 i 0.035 



The five observed differences of longitude give rise to two 
rigorous conditions represented by the following equations of 
condition : 
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The coefficients in the observation equations being all equal to 
unity, 

{_paa]=p^ lpbb']=p^ etc., 

= —f^ • — —, oj = -2: . J H, = —S , — _ etc., 

and from these expressions are derived the following values of 
the coefficients, together with the sums ir, =a,+0i, o-a = n4+ft, 
etc., which are to be employed as a check upon the formation 
of the normal equations for determining the correlates. 

Coefficients. 



and 





- 


s. 


,. 


4. 


s. 


. 


+ 0.18 


0.00 


-0.85 


+ 0.88 


0.00 





0.00 


+ 0.88 


-0.3S 


0,00 


+ 0.iS 


' 


+ 0.18 


+.0.38 


-0.70 


+ 0.38 


+ 0.45 



FOBMATIOM OF THE COEBELATE EQUATIONS. 



<Mi 


•» 


M 


PP 


P. 


+ 0.0824 
.0000 
,1225 
.1444 

.0000 


+ 0,0000 
,0000 
.1225 
.0000 
.0000 


+ 0.0324 
.0000 
.2450 
.1444 
.0000 


+ 0.0000 
.1444 
.1225 
.0000 
.2025 


0,0000 
.1444 
.2450 
.0000 
.2025 


+ 0.2993 
Cheek. 


+ 0.1225 


+ 0.4218 
0.4218 


+ 0.4094 


+ 0.5919 
0.6919 



Correlate Normal Equations. 
+ 0.2993 k, + 0.1226*, + 0M44 = fc, = - 0'.449 
+ 0.1226*1 + 0.4694ft, + 0. 091=0 ft, = -0.078 
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The absolute terma of the correlate equations are obtained 
by substituting the observed values a;\» j/o, Jo, Moj Wo in the equa- 
tions of condition, and the values of ki, k^ may be found from 
the correlate equations, either by Grauss's method of substitu- 
tion or by any of the ordinary algebraic processes of elimina- 
tion. The corrections to 3%, yoi i^ etc., and the adopted values 
of the unknown quantities, are now found from 

v,= +0.032A:i + 0.000ft,= -0'.014 3: = 23'"4I'.027 
1;,= +0.000*1-1-0.144*,= -0.011 y = 42 14.864 
v,= -0.122*. -0.122&,= +0.064 z = i7 27.777 
V4= +0.144 ft, + 0.000*4= -0.065 u = 23 46.751 
u, = + 0.000 *i + 0.202 fri=- 0.016 w= 5 12.913 

The values thus obtained satisfy the rigorous conditions of 
the problem, and are the most probable values which can be 
obtained from the data given above. 

§ 12. The Probable Error. Every intelligent observer 
desires to know something of the quality of his observations, 
how good or how bad they are ; the computer who has to com- 
bine the results of different series of observations should have 
some knowledge of their relative accuracy in order to assign 
to each series its proper weight ; and the investigator engaged 
in a complicated series of experiments desires some criterion 
by which to estimate the relative errors of the several parts 
of his work, in order to properly apportion his care among 
them, giving the maximum attention where the greatest errors 
are to be feared. It is evident from the nature of the case 
that no absolute criterion of this kind can be furnished, since 
any series of observations may be affected with systematic 
errors which seriously impair the accuracy of its results but 
furnish no indication of their presence. Both observer and 
computer do, however, estimate the accuracy of observations 
by their agreement among themselves, and that within certain 
limits this procedure is correct follows from Gaus^s law of 
error. If we suppose a very long series of observations 
affected only by accidental errors, the values of the unknown 
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quantities obtained from the series will differ but little from 
the true values (if the series is infinitely long they will be 
the true values), the residuals which they furnish will be very 
nearly the errors of observation, and the value of A in the 
equation of the error curve will furnish a measure of the 
precision of the observations as well as a measure of the 
smallness of the residuals. On the other hand, if the student 
attempts to construct the error curve corresponding to any 
short series of residuals, e.g., those of § 10, he will find that 
■while they give him some information in regard to the curve 
there will be much that is arbitrary in its actual construction, 
ajid that many curves can be drawn which will ap[>ear to fit 
the residuals equally well, i.e. the amount of data in this case 
is insufficient to determine more than a rough approximation 
to the measure of precision of the observations. If the obser- 
vations are affected with systematic errors, the residuals may 
be very different from the errors of the observations, and wUl 
then fui-nish no indication of their accuracy. 

It thus appears that any conclusions in regard to the ac- 
curacy of a given set of obsen'ations must be treated with 
caution if they are based solely on the residuals furnished by 
the observations. Such conclusions are, in fact, valid only 
within certain limits whose general nature is indicated above, 
but within these limits the information thus furnished may be 
of much value, and it is frequently employed for the purposes 
indicated at the beginning of this section. 

The measure of precision, A, seems to be indicated by its 
name as the appropriate means of expressing the average 
accuracy of a set of observations, but in practice it is not so 
used, another function of the residuals being found more con- 
venient. If in a very long series of observations the residuals 
be arranged in the order of their numerical magnitude (with- 
out regard to sign), that residual which occupies the middle 
place in the series will have as many residuals greater than it 
as there are less than it, and in any future series of observar 
tions of the same degree of precision as that here considered, 
it will be an even chance that any given residual will be 
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greater than, or less than, the middle one above selected. This 
middle residual ia usually denoted by r, and is rather inappro- 
priately called the probable error of the series, the adjective 
having reference to the equal probabilities of the occurrence ot 
residuals (errors) greater than, or less than, r. 

It is apparent that the greater the precision of any set of 
observation, the smaller will be the corresponding probable 
error, but the exact relation which exists between ft and r 
must be derived from the equation of the error curve. The 
symmetry of this curve with respect to the axis of p shows 
that the same law of distribution holds for both positive and 
negative errors, and that in a very long series of residuals the 
probable error r will occupy the middle place among the 
positive errors and among the negative errors considered separ 
rately, as well as among all the errors taken without regard 
to sign. Since we are concerned only with the numerical 
magnitude of r we may confine our attention to the positive 
residuals, and find the relation between r and k from that half 
of the error curve which lies to the tight of the axis of y. 

Since the probable error is a residual, it must be represented 
by the abscissa of some point on the axis of x, and we may 
determine this point from the condition that the ordinate 
drawn through it bisects the area of that half of the curve 
under consideration, since (from the relation between areas 
and the number of residuals of a given magnitude developed 
in g 4) this is the geometrical equivalent of the statement 
that tlie number of residuals greater than r is equal to the 
number less than r. By interpolation from the table in § 4, 
the value of the argument corresponding to .4 = 0.25 is found 
tohe hx = hr = 0.477, whence the relation between the proba- 
ble error and the measure of precision is 



0.477 



(22) 



The student will observe that in the definition of the proba^ 
ble error reference is made to a very long series of observations, 
and in a aeries of infinite length the value of r might be found 
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immediately from its defiDition, but in any ordinaty set of 
observations it is better to assume tliat the residuals are dis- 
tributed in accordance with tlie law of error, and to determine 
the value of r from the relation between b and the sum of the 
squares of the residuals, § 5, which gives 

r = ± 0.477 VSJtH 

We here encounter a difficulty arising from the attempt to apply 
to a short aeries of residuals principles which are rigorously true 
only when the series is of infinite length. Suppose the above 
expression for r applied to a series of three observations involv- 
ing three unknown quantities whose values are derived from 
the resulting observation equations. These values will exactly 
satisfy the equations, no matter what the errors of the obser- 
vations may be, and the residuals being all zero, thero will be 
found r = and A = oo , which is absurd. The observations in 
this case furnish no data from which to estimate their pre- 
cision, and in every such case where the number of observa- 
tions is equal to the number of unknown quantities, the 
expression for the probable error ought to become indeter- 
minate, -. It is therefore customary to put 

r = ±0.t!-4\[H (23) 

in which /i denotes the number of quantities whose values have 
been derived from the observations. This equation, which is 
known as Bessel's expression for the probable error of a sin- 
gle observation, being only an approximate one, we may usually 
put I in place of the coefficient 0.674. Among German physi- 
cists and astronomers, it is quite customary to siippress this 
coef&cient altogether, and to use the "mean error" 






(24) 

for the comparison of observations. Geometrically considered, 
( denotes the abscissa of the point of inflexion of the error 
curve. 
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A simpler expression for the probable error may be obtaiued 
by substituting in the equation 
,^ 0.477 
' h 

a value of h derived as follows : Let eacli member of the eqiia- 
tiou of the error curve be multiplied by xdx and integrated 
between the limits — qd and +00, giving 

Cm/dx = -^ C'xe -**'' dx 

The value of the first integral in this equation is obviously 0, 
since as we pass along the error curve from —00 to +ao every 
value of y occurs once associated with a negative value of x, 
and again with a numerically equal positive value, and for 
every negative element xydx in the integral there occurs an 
equal positive xydx so that the entire sum is 0. If, however, 
we agree to neglect the sign of x and to consider only its 
numerical value, we shall find 



I xydx = 2 I xydx 



and by a course of reasoning precisely similar to that applied 
in S 5 to the quantity I ^ydx, it may be shown that 2 | xydx 
is equal to the mean of all the residuals taken without regard 
to sign. We may therefore write 

where the + inside the brackets denotes that all of the resid- 
uals are to be treated as positive quantities. Putting Itx = t 
in the second member of this equation and remembering 
that here also we are concerned only with numerical values 
of x without regai'd to sign, we obtain 
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Introducing the limits into the integrated expression there 
results r+^'l 1 

" ~ ft Vx 
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and 



r= 0.477 V! 



:[+r] 



(25) 



This fonnuJa is rigorously coireet only when the number of 
observations is infinite, and it must be transformed so as to 
become indeterminate when the number of observation equa- 
tions is just sufficient to determine the unknown quantities, 
i.e., when n = fi. This might be accomplished by writing 
n — /t in place of n, as was done in equation (19), but it is 
customary to substitute in this case Vn(n— /t), which also 
renders r indeterminate when n = fi, and gives values of r more 
nearly in i^reement with equation (23). Making this substi- 
tution, we have 

r= ±0.845 ^C+J'l^ 

V«(«-/.) 

which, when ^= 1, is known as Peters' formula for probable er- 
rors. This formula is very convenient for the numerical compu- 
tation of probable errors, but where the number of observations 
is small the results furnished by equation (23) are considered 
more reliable, but neither formula can furnish a good determina- 
tion of probable errors from a small number of observations. 
The numerical application of these formulae may be illus- 
trated by the following short series of sextant observations for 
the determination of latitude. 



(26) 



ObMnrstlon 

43° 4' 46" 
4 24 



losHLr] 



2-403 
8.940-10 

0.926-10 



11 = 43 4 27 [-I-b]=263 [to] = 7703 



± 18".C 
] 3.88C 



log-^M 1 



izc.^C.DOglc 



I'aOBABLE ERROR OF A FUNCTION, 51 

The difEerence between the values of r found from the first 
and second powers of the residuals is small compared with the 
uncertainty of each arising from the small number of obser- 
vations. 

In so far as these observations can be considered as furnish- 
ing a value of r, they indicate that in a future series o£ similar 
and equally precise observations, there should be as many 
observations furnishing residuals (errors) greater than 18" as 
there are observations giving residuals less than 18". The ± ■ 
which is commonly prefixed to the numerical value of r, denotes 
that the observed quantity is as apt to err in excess as in 
defect. 

Let the student derive from the residuals given iii § 10 a 
determination of the probable error of an observed V, noting 
that in this case fi = 3. 

§ 13. Probable Error of a Function of Observed Quantities. 

Let x', x", x"' denote quantities which have been determined 
from observation, and let r', r", i-'" be their probable errors, 
liet u be a quantity whose value has been computed from the 
values of a/, x", x'" "by means of the relation 

u=f(x;x';x-) 

It is evident that the precision with which u is determined, 
depends upon the precision of x', x", x'", and by a slight exten- 
sion of the term "probable error" we may consider the pre- 
cision of w to be represented by a probable error, r, and may 
inquire the relation of r to r', r", r'". 

Since a probable error is one of the residuals or errors of a 
very long series, we may obtain the desired relation between 
r, r', r", r'" from a consideration of the general relation of any 
set of errors v', v", v'", in x', x", x'", to the corresponding 
error, v, in u. This relation is 

du , , du „ , du ,,, , . 
dx' dx" dx'" 

(see § 8). To a.ioid the necessity for considering the signs of 
v', v", v'", let this equation be squared, giving 
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'=(S)« 



Vdx'V \dx"- 



from which all terms involving the products v'v", v'o'", v"c"', 
etc., have been dropped for the reason that the probable error 
of u depends upon the average magnitude of v, and in tlie long 
run any pair of residuals v', v" will have opposite signs as 
often as they have like signs, and will therefore produce an 
equal number of positive and negative terms whose effect upon 
the mean value of i^ will be very small compared with the 
terms containing v", v"', v"", which are always positive. Re- 
placing these actual errors by the corresponding probable 
errors, we obtain 

and an equation of similar form will express the relation of 
the probable error of the function to the probable errors of the 
quantities upon which it depends, whatever the number of 
these quantities may be. 

We proceed to apply this relation to a few simple cases of 
frequent occurrence in practice. 

(a) The probable ei-ror of the sum of n observed quantities. 

In this case tt = a;' + «" + x'" + ■ ■ ■ + a.'" 

and each of the differential coeflicieuts — , — -, etc., equals 1 : 
dx' dx" 

whence r" = r" + r"^ + »■"" + — + r"' (28) 

{b) The probable error of the mean of n observed quantities. 
In this case u = -{x' + x" +x"' -i \-!^) 



dx' dx" ' n 
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We have here to distinguish two eases. If the x's are all of 
equal precision, the r's are equal, and may be represented by a 
common symbol I'l ; whence 

If the observations are of unequal precision represented by 

weights p', p", p"'---p'' 

we have 



<30) 



where rj denotes the probable error of an observation whose 
weight is 1. 

The relations here derived between the probable error of a 
single determination of a quantity, and the probable error of 
the mean of n determinations, may be employed in connection 
with equation (23), to determine the probable error of an 
adopted value based upon several determinations of a quantity. 
Thus, in the general case of observations of unequal weight, if 
Ti represent the probable error of an observation of weight 1, 
and r the probable error of the weighted mean, we have from 
equation (23), 

r,_U.674jIa3l 

''"-' I (31) 

'"vH J 

Let the student show that when the observations are of 

equal precision and 



pV + ,iV+!,"V" + 


..p-rr- 


M 




dit ]i' du j>" 


etc. 






' * Vtri 
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= (I (at' — x" — x'") >■ =s ± tt VS r" 

. x' , x' r' 

= sin r =s ± cos — . - 



--e;^ 



i.x" x"*} 



§ 14. AMignment of Weight!. Kejectdon of Obaeiratioiu. 
The term weight has been employed in the preceding sections 
as a measure of the quality of aa ohservatioa, but its use is by 
no means limited to the case of single observations. Thus, 
from an Investigation of the Distance of tlie San, etc., by iS. 
Nevxomb, we select the following determinations of the solar 
parallax. 

MMhod by which determined. Pamllax. Welslil. 

Meridian observations of Mam, 1862. 

Micrometrlc observations of Mars, 1862. 

Parallactic Inequalit; nf the Mood. 

Lunar equation of the Earth. 

Transit of Veaus, 1769. 

Each value of the parallax here given is the final result of 
an elaborate discussion of many observations, and the weights 
indicate the relative excellence attributed to these results by 
the author of tlie investigation. If x denote any one of these 
values of the parallax, p its weight, and xo the most probable 
value of the parallax, we shall have 

xo = M = 8".847 (§6) 

It is to be noted that this value depends upon the weights 
assigned to the individual determinations, and that by prop- 
erly selecting the weights, v^ may be made to assume any value 
whatever between the least and the greatest single determina- 
tion. Thus if the weight 100 be assigned to the value 8".860, 
and to each of the other values the weight 1, we shall find 
iTo = 8".859, while a weight 100 for the value 8".809 with a 
weight 1 for each of the others, makes ir,, = 8".811. Between 
these limits the value of xu depends upon the judgment of the 
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computer in assigning weights, aiid this determination of 
weights is one of the most delicate questions that arise in the 
application of the method of least squares. 

A relation between weights and probable errors may easily 
be established, which is frequently of service in that it enables 
the problem of weighbs to be stated in a different form. Let x 
denote an observation whose probable error is r and whose 
weight is 1, and denote by x', x", x'", etc., observations or com- 
binations oi observations of the same quantity, whose weights 
and probable errors are represented by p', p", p'", r', r", r'", 
etc. In accordance with the definition of weights, x' is the 
equivalent of p' observations of the same quality as x, and 
from the equations derived in the preceding section we have 

r' — ^ 

with a similar expression for each of the other quantities ■ 
whence 

py = p"y"- = p'^'r"" = 1^ 

and P'=-T2' P"=^h' ^'" = -77^ (32) 

and, in general, the weights are inversely proportional to the 
squares of the probable errors. 

It has been sufficiently shown that probable errors derived 
from the residuals furnished by a series of observations repre- 
sent only the effects of accidental errors of observation, but we 
may extend the significance of the term so as to include an 
estimate of the effect upon x', x", x'" of syst^natic errors in 
the observations. Let r, and r, represent those parts of the 
probable error which come from these two sources respectively, 
and from § 13 we find for their combined effect 



and the expression for the weights becomes 



(S3) 
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By this device the determination of weights is reduced to an 
estimate of the combined effect of accidental and aystematic 
errors of observation upon the quantity whose weight is 
desired, and it was from an estimate of this character that tlie 
weights of the parallaxes given above were derived. 

If r' denote the probable accidental error of a single obser- 
vation, and the quantity whose weight is /i is the mean of n 
such olMervations, we shall have 



from which all constant factors have been dropped, since 
only relative values of p are ever required. It appears from 
this equation that if the systematic errors, r,, are very small 
compared with the accidental errors, r', the weight increases 
rapidly as the number of observations is increased, but if the 
systematic errors are large, the weight is but little affected by 
the number of observations; a relation to be considered in 
deciding how many observations shall be made to determine 
an unknown quantity. 

In some cases it may be impossible to form any reliable 
estimate of the effect of systematic errors, and results which 
have been derived by different methods, or under different cir- 
cumstances, may then be given equal weights on the supposition 
that they are affected by different systematic errors which it 
is equally important to eliminate ; but this is equivalent to 
putting )-j = 00 in the equation for the weights, and it will 
rarely happen that this is the best estimate which can be made 
for the amount of the systematic errors. 

It frequently happens that in a series of otherwise accordant 
observations, one or two will be found which differ widely from 
the others, and which if included in the final result, will 
furnish large residuals. What shall be done with observations 
of this kind has long been a vexed question. To reject them 
ia equivalent to assigning to them the weight 0, and is the 
expression of the computer's jiidgment that they can contribute 
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nothing to the accuracy of the result which he seeks to obtain. 
In an infinitely long series ot observations, errors of any finite 
magnitude may be permitted without impairing the accuracy 
of the final result, and the existence of such errors seems con- 
templated by the theory which we have adopted, since the 
equation of the error curve gives finite values of y for all 
values of x between the limits — oo and + oo . But in the 
actual case which arises in practice where a result must be 
obtained from a comparatively small number of observations, 
a single one of these, if affected with a large error, may make 
the final result farther from the truth than any one of the 
other observations. On the other hand, cases are by no means 
imknown in which a single discordant observation in a series 
proves to be nearest to the true value of the quantity sought, 
the others having all been vitiated by some common cause ; 
and between these extremes an infinite variety of cases may 
be found. It must in general remain a matter of doubt 
whether a given discordant observation should or should not 
be rejected, and the decision made by the computer must be 
his judgment based upon all the data available as to whether 
more will be gained by rejecting than by retaining it. A knowl- 
edge of the way in which observations are made, of the circum- 
stances attending the particular observation in question, the 
magnitude of the errors which may reasonably be expected 
with the given observer and apparatus, or instrument, are 
elements which should be included in this judgment ; and the 
observer will greatly facilitate its formation by making copious 
notes at the time of observation of all circumstances which in 
his opinion may affect the quality of his work, and particularly 
by noting any abnormal circumstances affecting a single obser- 
vation or a part of the observations. 

A doubtful observation should be rejected if it is the com- 
puter's deliberate judgment that its retention will hurt more 
than it will help bis final result, but it is never legitimate for 
the computer to suppress an observation. A rejected observa- 
tion should be included in the statement of his data, and may 
properly be accompanied by an explanation of the reasons for 
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its rejection, iu order that any person interested in the result 
may form his own judgment of the data and the manner in 
which they have been discussed, and may, if necessary, re- 
discuss the observations in accordance with that judgment. 

The conclusion of the whole matter of assigning weights to 
numerical data may be summed up in the statement that no 
mathematical expression will suffice for this purpose, but the 
weights must be determined by an exercise of personal judg- 
ment, and the wider the knowledge upon which this judgment 
is based, the greater confidence will the weights and the result- 
ing values of the unknown quantities command. 

g 15. Empirical or Interpolation Fannnlfe. In the preceding 
sections attention has been directed to that class of problems 
in which the theoretical relation between the observed quanti- 
ties and those whose values are to be determined is known ; 
that is, an equation of known form exists between them, and 
the problem has been to determine the values of the constants 
which appear in the equation. But a very different class of 
cases now demands a passing notice. 

A series of observations is sometimes found to be affected 
with errors too great to be explained as the result of unavoid- 
able and fortuitous causes, and it becomes apparent that the 
law of recurrence of these eri-ors must be. determined before 
the observations can be made to yield any valuable results. 
The American parties which were sent out in 1874 to observe 
the transit of Venus were provided with instniments for the 
determination of their local time, of such a character that the 
accidental error of a determination from a single star might 
fairly be estimated at 0'.05 or O.OG, but results obtained from 
observations of different stars varied among themselves by 
more than ten times this amount. An inspection of the dis- 
crepancies having shown that they depended in some way upon 
the distance of the observed star from the zenith, it was found 
by trial that the error at any zenith distance, z, could be repre- 
sented by the expression 

E = ±\aeosz-bsm2zl 
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where a and b are conatanta whose values were found from the 
observations themselves. The physical cause of these errors 
was subsequently found to be the bending of the instrument 
under its own weight, but it is to be noted that the above law 
of recurrence of the errors was determined first, the cause 
afterwards. 

Expressions of this kind are sometimes called interpolation 
formnlce and sometimes empirical equationa; the one term hav- 
ing reference to their use, the other to their derivation. They 
are of very general use in all branches of physical science, 
since they may be made to serve as a convenient summary of a 
vast amount of numerical data, and one of the most important 
applications of the method of least squares is in determining 
the values of the constants which enter into such expressions. 
The problems treated in §§ i and 10 both belong to this class, 
and the following expression for the magnetic declination at 
Washington, D.C.j derived by Mr. C. A. SckoU* from a series 
of observations extending over ninety years may serve as a 
further illustration : 

Mag. Dec. = 2''.47 + 2=.50 sio [1°. 40(7-1850) - 14°.6] 
where T denotes the year for which the declination is required. 

When the cause whose effects are to be represented by an 
equation is known, the form of this equation can usually be 
derived by mathematical analysis ; but where empirical formulae 
are employed other methods must be resorted to. The simplest 
of these is a graphical representation of the errors or other 
data under consideration. For this purpose let the errors 
represent ordinates, and the values of any variable upon which 
they are supposed to depend, the corresponding abscissas. Let 
points be plotted with these ordinates and abscissas as was 
done in obtaining the form of the error curve, Figs. A, B, C, D, 
and let a smooth curve be drawn through these points either 
free-hand or by the aid of a draughtsman's "irregular curve." 
The distance of each plotted point from the curve, measured 
alon^ an ordinate, is the residual corresponding to the point, 

• U.S. C.>i:G. S. Report, 1882. p. 258. 
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and in accordance vrith the principle of least squares the curve 
should be so drawn as to make the sum of the squares of these 
residuals as small as possible, without unduly complicating the 
curve. If the variable has been properly chosen it will in 
inany cases be found possible to draw a simple curve which 
shall represent the data within the limits of the accidental 
errors of observation, and as this curve is the graphical rep- 
resentation of the law required, its equation, y =/{x), is the 
analytical representation of that law. In this manner the 
form of the equation treated in § 10 was obtained. 

In other cases it will be possible to draw a smooth and sim- 
ple curve which shall not represent the data within the limits 
of accidental error of the observations, but about which the 
points will be grouped, alternating from one side to the other 
in a systematic manner. Let the excess of the ordinate of any 
point over the corresponding ordinate of the curve be plotted 
with the given abscissa in a new curve. The two curves thus 
constructed will together form the graphical representation of 
the law of the data, and the analytic expression of that law 

"'"" !(=/W + *W 

ity=f(x) andy = ^(D)) are the equations of the two curves 

respectively. 

In some eases the curves themselves will be a sufficient 
representation of the data, and it will be unnecessaiy to deter- 
mine their equations since the value of y corresponding to any 
given value of x may be obtained by direct measurement. In 
other cases the curve will be chiefly serviceable in suggesting 
the probable form of an equation between the observed quan- 
tity and a variable upon which it is supposed to depend, or in 
showing that no simple relation exists between them. Two 
forms of equation are of such frequent use in this connection 
that they deserve especial notice. 

If the plotted curve does not differ very greatly from a 
straight line, the relation of the variable x to its function y 
may be represented by 

i/=a + l>x + cx' + ax' + etc. (35) 
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This equation contains the first few terms of an infinite 
series by which a limited arc of any continuous curve can be 
represented, and since the actual relation between y and x 
could be represented by a curve, if its mathematical expression 
were known, it follows that the above equation can be made to 
represent this relation over a certain range of values of x, by 
assigning proper values to the coefficients. The number of 
terms of this series which should be taken into account, and 
the limits of x beyond which the equation ia not applicable, 
depend upon the actual relation between y and x, and are 
therefore unknown ; but, in general, it is not well to attempt to 
use this equation for large values of x, or when more than 
three or at most four terms are required. Its application in a 
simple case is illustrated in the problem of S 1, where y aad x 
being replaced by the length of the bar and its temperature, it 
is assumed that their relation can be expressed withiti the 
range of temperature over which the observations extend, by 
the first two terms of the series. 

The second type of equation above referred to ia 

y = 00 + a, cos — hoacos — ^ + {i}Cos— + etc. 



in which m is au undetermined constant expressed in the same 

unit as x; — is therefore a ratio, or absolute number, which 

in the application of the equation to numerical data must 
be transformed into circular measure by multiplying it by 

= 57°.29d78. This form of equation may be made to 

represent any relation whatever between finite values of y and 
X, including those cases in which y is a diacontinuous function, 
but it is especially advantageous when y ia a periodic function 
of X, i.e., one in which the same values of y recur for values of 
X, separated by a constant interval, t, called the period of x, 
so that 
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f{x) =f{x + r) =f(x + 2r) = - =f{x + ur). 
The simplest type of such a function is y = ainx, the period 
in this case being t = 360° ; e.g., 

sin 10" = sin (10° + 360°) = siu (10° + 720°) = etc. 
When y is such a function the constant m should be put equal 
to the period divided by Zs-, m = ~; in other cases the value 
of m should be so taken that the greatest value of — included 

among the data shall not exceed v. The application of this 
formula may often be facilitated by noting that if the relation 
between y and x is such that f(+x) =/( — »), the sine terms 
^1 disappear, and the equation reduces to 

y = tifl + a, cos — h cii cos (- etc, (37) 

while if f(x) = —f{ — x), the cosine terms vanish, and the 
equation becomes 

y =s 6(, + ^ sin — + &2 sin— + etc. (38) 

The several forms above given to this type of equation are 
those most convenient for use when the values of the coeffi- 
cients a, b, etc., are to l>e determined, but after their numerical 
values have been found it is advantageous to transform the 
equation as follows : Introduce the auxiliary quantities 

%, Ml, Mj, S^i, ^i, etc. 
defined by the relations 

Hg = do ni cos y, = % jij cos JTj =s Oj 

Ml sin Jfi = bj Jij sin Ni=b3 

and the equation becomes 

y = nt + n,coaf-- NA + iijcos (— - nA + etc. (39) 

ig here replaced 
letic declination 



each pair of terms of the original equation being here replaced 
by a single term. The expression for the magnetic declination 
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at Washington given on p. 59 is of this type, as may be seen 
by writing It in the equivalent form 

Mag. Dee. = 2°.47 + 2°.50 coa [1°. 40(7-1850) - 104°.6] 

The mode of applying this form of equation may be illus- 
trated by means of the following data selected from the series 
.of observations whose residuals are plotted in Fig. C. The 
observed quantity, B, is the difference of stellar magnitude 
(brightness) between the planet Saturn and his satellite Iai)e- 
tus. The quantity I given with each observed B iiKes the 
position of the satellite in its orbit at the time of observation, 
and is analogous to the variable angle ^ in a system of polar 
coordinates. 



B is here seen to run tiirough a complete cycle of values 
between the limits 9.87 and 11.81, wliile I varies from 0° to 
360°. We shall therefore endeavor to represent £ as a periodic 
function of I whose period is 360°. In accordance with this 
assumption we put 

X _ J_ 180° 
m " 360° ■ 



- = ;; y = B 



and tailing into account the first five terms of the series, the 
several observations furnish the following 

Observation Equations. 
10.82 = 00 + 0.98 a, + 0.17 6i + 0.94 a^ + 0.346, 
11.81 = o„ + 0.34a, + 0.946, ■- 0.770, 4- 0.64 6^ 
11.69 = «„ - 0.34 ff, + 0.94 6, - 0.77 fl^ - 0.64 63 
11.42 = a^- 0.77 a, + 0.64 6, + 0.17 a, - 0.98 b^ 
10.66 = o„ - 0.94 Oi- 0.34 &, + 0.77 Wj + 0.64 6j 
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9.87 = a„ - 0.64 rt, - 0.77 6, - 0.17 a, + 0.98 &, ' 
10.43 = a, + 0.00a, - l.(M)6, - 1.00 u^ + 0.006, 
10.48 = ffl„ + 0.04a, - 0.77 &, - 0.17 a^ - 0.i)8&, 

The Bolutioii of these equations will be found in the follow- 
ing section. The values obtained for the consttuits are 



Introducing the constants >i, If, and determining their valuea 
from the relations 

«o=10.92 «|CosiV, = + 0.1.-i jij cos .Y, = - 11.04 
H, ainy, = + 0.74 Kj siu.Y, = - 0.18 
the equation becomes 

B = 10.92 + 0.76 cos{l - 78°.5) - 0.18 nosCIl ~ TT^.o) 

The residuals obtained by compai-ing the values of B computed 
from this formula with the observed values are given above 
with the data. 

Abundant data for exercise in deriving empirical formulae of 
this kind may be found in the United States Coast and Geodetic 
Survey Rep(yrt for 1882, pp. 218-257. 

§ 16. Approximate Solutioni. It is often desirable to obtain 
from a series of observations, as rapidly and with as little labor 
as possible, a set of values of the unknown quantities involved 
which shall be fair approximations to their most probable val- 
ues, but in which the highest degree of accuracy is not required. 
In cases of this kind the least square treatment of the obser- 
vation equations as illustrated in § 10 is too long and laborious, 
and the following method may often be substituted for it with 
advantage. 

Let there be any number, e.g., three, unknown quantities in- 
volved in a set of observation equations of the form 
<ix + by ■\-n + n = <i Weight =p 
and let each of these equations be multiplied by its weight, 
giving the group of equations, 
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«!» + % + Ci* + Wi = I; I (40) 

etc. etc. etc. etc. J 

Multiply each of these equations by the undetermined con- 
stant k placed opposite it, and let the sum of all the result- 
ing equations be formed. By the use of the summation 
symbol, [ ], this sum may be written 

Ikap + [M]y + lkc]z+[ku} = 

Since the several values of k which enter into this equation 
are entirely arbitrary it would be permissible to assign to them 
such values that [^kb'] and [At] should each equal 0, which 
would give at once 

This, however, is not practically advantageous on account of 
the labor involved in determining the values of k. We there- 
fore put 

[*,] [M] [fe] ,^21 

and, limiting the values of ft to -|- 1 and — 1, assign them in 
such a manner that [ka] shall be made as great, and [ftb], 
[fcc] as small as possible. In this manner the coefficients of 
y and z may often be made so small that if approximate values 
of y and z are substituted in equation (42), they will furnish 
a sufficiently accurate value of x; since the effect of the errors 
of these approximations will be much diminished by the small 
coefficients by which they are multiplied. 

The value of y may be found in the same manner by select- 
ing a set of k's whicli shall make [ft6] lai^e and [ka], [kc] 
small, and similarly for z. Two or three trials may be required 
before sufficiently close approximations to the values ot x, y, z 
are obtained, but these trials are rapidly and easily made, and, 
if necessary, in exceptional cases the summation eqiiations 
may be written in the form 
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[A"«]r+[*"6]y+[t"c> + [A"»} = *> > (43) 

and the equations solved by any of the methods of elementary 
algebra, but in every ease the values of k, +1 and ~1, should 
be so chosen that in the first equation the coefficient of x, in 
the second equation the coefficient of y, and in the third equa- 
tion the coefficient ot z, shall be made as large, and all of the 
other coefficients as small as possible. 

By this mode of solution each observation with its proper 
weight is included in the determination of the unknowns, but 
since the principle of least squares has not been taken into 
account, it cannot be expected that the resulting values will 
be the best that the observations can be made to yield. 

To illustrate the mode of solution we recur to the observa- 
tion equations contained in the preceding section and putting 
Og = 10.00 -I- a write them as follows, placing the several val- 
ues of k at the right of each equation. 



coal sinE cob2I 8Id2; 

0.82=0+0.98 o, +0.17 &, -1-0.94 a,+0.34&, 
lM=a+0Ma,+0Mb,-0.77th+0Mb^ 
1.69=a-0.34a, +0.94 6,-0.77 0,-0.646, 
1.42 =0-0. 77 a, +0.64 6, +0.17 Oj- 0.98 dj 
0.66=a-0.94ai-0.346,+0.77a,+0.646, 
-0.13=0-0.640,-0.776,-0.17 ns+0.9852 
0.43=a+0.00a,-1.00 6,-1.00 O;+0.006j 
0.48=o+0.64oi-0.77 61-0.17 0,-0.98 6i 
The summation equations obtained from 
+ 7.18 = 8a - 0.73 a, - 0.19 6, - 1.00 a. -\ 

- 0.10 = Oa + 4.65o, - 1.136, - l.OOo, -| 
+ 4.30 = Oo + 1.15 a, + 5.57 6, + 0.14oj 

- 0.42 = Oa + 0.53 o, - 0.41 6, + 4.42a, 

- 0,86 = Oa + 0.21 o, + 0.19 6, + 2.54 a.. 



k' k" k"< k^- k- 
+1+1+1+1+1 
+1+1+1-1+1 
+1_H,]_1_1 

+1-1+1+1-1 
+1-1-1+1+1 
+1-1-1-1+1 
+1+1-1-1-1 

+1+1-1+1-1 

tltis group are : 
-0.00 62 




l,zc.l.yG00g[c 



apfboximatb solutions. 67 

and these correspond to the normal equations of a least square 
solution. To apply the method of approximations to the solu- 
tion of this group of equations, we write them In the form : 
o = +0.897 -|-0.091o, + 0.024b, + 0.125a,^ 
a, = -0.022 + 0.000a, + 0.243^ + 021"^c.. 
6, = +0.772-0.206«i + 0.000ft,-0.O25aj > (45) 

0,= -0.095-0.1200,4- O.O936i + O.OO0a, I 
bi = - 0.166 - 0.040ai - 0.036?>, - 0.488as J 

The divisions required in making this transfonnatiou were 
made by the use of Cretk's Rechentafeln. 

By operations which can be performed mentally, we obtain 
the following acta of approximations to the values of 
1. II. III. 

oi 0.0 +0.2 +0.15 

6, +0.8 +0.7 +0.74 

a, -0.1 -0.0 -0.04 

and substituting in equations (45) the values given under iil 
we find the adopted values 

a, = 10 + a = 10.92 a, = +0.15 a,j=-0.04 
i, = + 0.74 hi=- 0.18 
which were employed in the preceding section. 
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y= -— e-*'-'' §3,4 

i=i? ^ ^ 

-w ^^ 

[».]» + [a!,]j, + M. + [,„.] = 5 7 

[a6]=l![(o + 6)']-([<K.] + [»])j 5 SI 

[a«]-Lo<i] + [o6] + [ai!]+-+[o»] 5 9 

[6c.l] = [te]-M[„] 5 9 

[c..q-[-.l]-Ig^[»..l] 5 9 

[«.]l-,+[.fl», + *(»S,»,!i)-9 sll 

r.»-4II- ±0.674 JS:=±„.845a+d=. . 5 12 

'■"v^^vis! * 

/>';i)":i)"' = — : — r— S 14 

y = a + 6a; + ca^ + etc §15 

y = no + »iCos( .JT, I + Kjcosf — — JV";) +etc. . . §15 
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